Using Web-Based Corpora to Find Norwegian Specialised Neologies
نویسندگان
چکیده
منابع مشابه
Vocabulary-Based Language Similarity using Web Corpora
This paper will focus on automatic methods for quantifying language similarity. This is achieved by ascribing language similarity to the similarity of text corpora. This corpus similarity will first be determined by the resemblance of the vocabulary of languages. Thereto words or parts of them such as letter n-grams are examined. Extensions like transliteration of the text data will ensure the ...
متن کاملUsing collocations from comparable corpora to find translation equivalents
In this paper we present a tool for finding appropriate translation equivalents for words from the general lexicon using comparable corpora. For a phrase in the source language the tool suggests a range of possible expressions used in similar contexts in target language corpora. In the paper we discuss the method and present results of human evaluation of the performance of the tool.
متن کاملHarnessing the lawless: using comparable corpora to find translation equivalents
Bilingual dictionaries provide basic translation equivalents for a headword and typically limit the set of equivalents to words of the same part of speech as the headword. However, words taken in their contexts can be translated in many more ways. At the same time, equivalents listed in dictionaries are not adequate in many contexts, because of the contextual and collocational sensitivity of ta...
متن کاملUsing the Web to Interactively Learn to Find Objects
In order for robots to intelligently perform tasks with humans, they must be able to access a broad set of background knowledge about the environments in which they operate. Unlike other approaches, which tend to manually define the knowledge of the robot, our approach enables robots to actively query the World Wide Web (WWW) to learn background knowledge about the physical environment. We show...
متن کاملUsing the Web to Create Minority Language Corpora
ABSTRACT The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approach for automatically generating Web-search queries for collecting documents in a minority language. It di ers from pseudo-relevance feedback in that retrieved documents are labeled by an automatic language cla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Communication & Language at Work
سال: 2012
ISSN: 2245-5744
DOI: 10.7146/claw.v1i1.7235